Generation of aggregatable dimension information within a multidimensional enterprise software system

ABSTRACT

Techniques are described for generating dimension information in hierarchical form. A computer-implemented system, for example, includes a data store having multidimensional data. The multidimensional data includes a data cube having a dimension. The system further includes an executable software module that accesses the data store and generates hierarchy data describing a hierarchy of the dimension of the data cube. The software module generates the hierarchy data so that the hierarchy data is guaranteed to aggregate to totals represented within the data cube.

TECHNICAL FIELD

The invention relates to software systems and, in particular, tocomputing environments for enterprise business planning.

BACKGROUND

Enterprise software systems are typically sophisticated, large-scalesystems that support many, e.g., hundreds or thousands, of concurrentusers. Examples of enterprise software systems include financialplanning systems, budget planning systems, order management systems,inventory management systems, sales force management systems, businessintelligent tools, enterprise reporting tools, project and resourcemanagement systems and other enterprise software systems.

In many situations, a user may wish to publish data from one enterprisesoftware system to other third party software tools. As one example, theuser may wish to publish data from a financial planning system toreporting and analysis software. However, many enterprise softwaresystems, such as financial planning systems, store data inmultidimensional data cubes. It is often difficult to publish data fromthe multidimensional environment of the enterprise software system toreporting software, which typically stores data in a relationaldatabase. In other words, the multidimensional nature of the enterprisesoftware system is often incompatible with the two-dimensionalrelational format utilized by the reporting software.

For example, multidimensional data cubes consist of multiple dimensionsand measures. In general, a dimension is a structural attribute of adata cube that is an organized hierarchy of categories. For example, ageography dimension might include levels for country, region, state orprovince, and city. Measures represent the data values along the cellsof the dimension.

In some situations measures within a multidimensional data cube varywith data type and formatting along the dimensions of the data cube. Forexample, a defined measure may vary from a string data type for certaincells along a dimension to a numerical data type for different cellsdepending on the different dimensions of the data cube. Consequently, itis often difficult to publish the data cube and correctly represent theformat and data type of the measures.

As a result, many conventional enterprise systems published themultidimensional data in a simple text format. However, this preventsthe reporting software from being able to perform further calculationsand analysis on the published data.

SUMMARY

In general, the invention is directed to techniques for publishingmultidimensional data from an enterprise software system. The techniquesmay, for example, publish multidimensional data to a relational databaseschema that is optimized for reporting purposes. For example, thedatabase schema may be a relational star schema as described herein.

The techniques may include processes for automatically producing thedatabase schema based on the organization of the multidimensional datacube, and for populating the database schema with data from the datacube. The database schema may be used for reporting the multidimensionalenterprise data, or may serve as a staging area to move the enterprisedata toward a data warehouse within an enterprise software system.

The described techniques may further include a process for automaticallygenerating a reporting model from the database schema. The reportingmodel serves as a framework from which reports can easily be created foraccessing and presenting the multidimensional enterprise data publishedto the database schema. Further, techniques are described forautomatically regenerating the reporting model from the database schema,and synchronizing the regenerated reporting model to include any userchanges applied to the previous reporting model.

In one embodiment, a computer-implemented system comprises a data storehaving multidimensional data. The multidimensional data includes a datacube having a dimension. The system further includes an executablesoftware module that accesses the data store and generates hierarchydata describing a hierarchy of the dimension of the data cube. Thesoftware module generates hierarchy data so that the hierarchy data isguaranteed to aggregate to totals represented within the data cube.

In another embodiment, a computer-implemented method comprises storingmultidimensional data that includes a dimension having hierarchicalmembers, and generating hierarchy data that describes the hierarchicalmembers of the dimension in a form that is guaranteed to aggregate tototals represented within the data cube.

In another embodiment, a computer-readable medium comprisesinstructions. The instructions cause a programmable processor to accessmultidimensional data that includes a dimension having hierarchicalmembers, and generate hierarchy data that describes the hierarchicalmembers of the dimension. The hierarchy data defines a hierarchy ofnodes that represent members of the dimension, and the nodes includes aset of sum nodes associated with calculations that are summations ofother nodes and a set of root nodes associated with calculations otherthan only summations.

The techniques may provide one or more advantages. For example, thetechniques provide for the automatic creation of a database schema, suchas a relational star schema, that can accept heterogeneous data typesand heterogeneous formats that may be utilized within the data cube.Moreover, the schema may automatically be configured to store themetadata required for interpreting the contained enterprise data. Inthis way, other software applications, such as reporting tools, mayreadily utilize the database schema and the enterprise data containedtherein for analysis and reporting.

The details of one or more embodiments of the invention are set forth inthe accompanying drawings and the description below. Other features,objects, and advantages of the invention will be apparent from thedescription and drawings, and from the claims.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram illustrating an example computing environmentin which a plurality of users interact with an enterprise planningsystem.

FIG. 2 is a block diagram illustrating one embodiment of a remotecomputing device for interacting with the enterprise planning system ofFIG. 1.

FIG. 3 is a block diagram that illustrates certain software componentsof the remote computing device in further detail.

FIG. 4 is a diagram illustrating one embodiment of the relationalschema, including a relational star schema for storing metadatapertaining to the data cubes and metadata tables pertaining to theproduced schema.

FIG. 5 is a diagram illustrating one embodiment of the dimension tablesof the relational star schema.

FIG. 6 is an overview of an example process for producing and populatinga database schema, and for publishing reports from the database schema.

FIG. 7A show an example of a simple summary dimension hierarchy, andFIG. 7B depicts a corresponding simple summary hierarchy produced by aschema generator during publication for storage within a simple sumstable.

FIG. 8A shows an example of a non-simple summary dimension hierarchyincluding a leaf node with multiple parents, and FIG. 8B depicts theresulting simple summary hierarchy produced by schema generator duringpublication for storage within the simple sums table.

FIG. 9A shows an example of a non-simple summary dimension hierarchy,and FIG. 9B depicts the resulting simple summary hierarchy produced byschema generator during publication for storage within the simple sumstable.

FIG. 10A shows an example of a non-simple summary hierarchy including asub-hierarchy of a non-simple summary, and FIG. 10B depicts theresulting simple summary hierarchy produced by schema generator duringpublication for storage within the simple sums table.

FIG. 11 shows an example of a fact table created for one data cube bythe process shown in FIG. 6.

FIG. 12 is a flowchart illustrating an example process for generating areporting model.

FIG. 13 is a flowchart illustrating an example process for updating areporting model.

FIG. 14 is a screen illustration of an example user interface with whicha user interacts to initiate publication of multidimensional data.

FIG. 15 is a screen illustration of an example user interface with whicha user selects one or more dimensions for any of the available datacubes.

FIG. 16 is a screen illustration of an example user interface with whicha user may select different options to create columns.

FIG. 17 is a screen illustration of an example user interface producedby the schema generator.

FIG. 18 is a screen illustration of an example user interface of themodel generator.

FIG. 19 is a screen illustration of another example user interface ofthe model generator.

FIG. 20 is a screen illustration of another example user interface ofthe model generator.

FIG. 21 is a screen illustration of another example user interface ofthe model generator.

DETAILED DESCRIPTION

FIG. 1 is a block diagram illustrating an example enterprise 4 having acomputing environment 10 in which a plurality of users 12A-12N(collectively, “users 12”) interact with an enterprise planning system14. In the system shown in FIG. 1, enterprise system 14 iscommunicatively coupled to a number of computing devices 16A-16E(collectively, “computing devices 16”) by a network 18. Users 12interact with their respective computing devices 16 to access enterpriseplanning system 14.

For exemplary purposes, the invention is described in reference to anenterprise planning system, such as an enterprise financial or budgetplanning system. The techniques described herein may be readily appliedto other software systems that utilize multidimensional data, includingother large-scale enterprise software systems. Examples of otherenterprise software systems include order management systems, inventorymanagement systems, sales force management systems, business intelligenttools, enterprise reporting tools, project and resource managementsystems, and other enterprise software systems.

In general, enterprise planning system 14 enables and automates thereconciliation of top-down targets with detailed bottom-up forecasts foran enterprise. Enterprise planning system 14 implements and manages anenterprise planning process, which generally consists of threefunctions: (1) modeling, (2) contribution and (3) reconciliation.

Initially, high-level enterprise managers or executives, referred to asanalysts, define organizational targets and build planning models forthe enterprise. The analysts may include, for example, financialanalysts, such as the chief financial officer, senior financial analystsor product and sales analysts. More specifically, the analysts develop amodel having a number of hierarchically arranged nodes representingvarious cost centers within the organization, such as business units ordepartments. The analysts then specify corporate target data for eachnode of the organizational hierarchy. Corporate target data may includefinancial data, revenue data, order data, inventory data, and the like,depending on the particular enterprise planning activity being carriedout by the enterprise. The analysts then assign one or more enterpriseusers 12 to each node, such as managers, supervisors, salesrepresentatives, lab managers, or the like, that are responsible forenterprise planning for the cost center corresponding to the node. Eachenterprise user 12 may be designated as a contributor that providesplanning data to enterprise planning system 14, a reviewer that acceptsor rejects contributions from the contributors, or both. Thecontributors and reviewers may be authorized users within the enterpriseor within other entities coupled to network 18, such as suppliers orcustomers.

The enterprise users 12 that are designated as contributors interactwith enterprise planning system 14 to input detailed forecasts in theform of contribution data. As described above, enterprise users 12 mayprovide detailed financial forecasts, revenue forecasts, orderforecasts, inventory forecasts, estimated resource requirements, and thelike, depending on the particular enterprise planning activity beingcarried out by the enterprise.

Enterprise planning system 14 automates the reconciliation of theforecast data with the corporate target data provided by the analysts.In particular, enterprise planning system 14 operates in accordance witha defined model, i.e., the enterprise planning model created by theanalysts, to provide a hierarchical planning process having multiplereconciliation levels. As each of the contributors provides his or hercontribution data (referred to generally, as “enterprise data”),enterprise planning system 14 automatically aggregates the contributiondata across the enterprise in real-time, and provides access to theaggregated data to enterprise users 12 designated as reviewersassociated with higher levels of the enterprise. In particular, uponreceiving contribution data from the contributors, enterprise planningsystem 14 identifies all higher levels of the organizational modelaffected by the newly received contribution data, and calculates newaggregate totals at each level in real-time.

Consequently, the reviewers view aggregated data across the enterprisein real-time during the enterprise planning session. At each level,enterprise planning system 14 ensures that the reviewers, as defined bythe nodes of the enterprise model, reconcile the target data with theforecast data. Each of the reviewers may, for example, reject or acceptthe contribution data in view of corporate targets provided by theanalysts. This process continues until the contribution data isultimately approved by the highest level of the organizationalhierarchy, thereby ensuring that the contribution data from thecontributors reconciles with corporate targets provided by the analysts.

In this manner, enterprise planning system 14 may provide more accurateenterprise planning than with conventional techniques. For example,enterprise planning system 14 may improve the accuracy andpredictability of enterprise planning by enabling organizations toreconcile corporate models and organizational targets with detailedforecasts. The techniques may provide a platform that deliverscollaborative, real-time planning capabilities, without requiringoffline consolidation and aggregation of forecasts. Because enterpriseplanning system 14 can aggregate contribution data in real-time, allusers 12 can be presented with an accurate, up-to-date view of thenumbers. Further, the architecture of enterprise planning system 14 canreadily scale to thousands of users, and may be designed around bestplanning practices. In addition, the techniques enable highparticipation by enterprise users 12, i.e., the contributors andreviewers, allowing accurate planning cycles to be reduced.

Enterprise users 12 may use a variety of computing devices to interactwith enterprise planning system 14 via network 18. For example, anenterprise user may interact with enterprise planning system 14 using alaptop computer, desktop computer, or the like, running a web browser,such as Internet Explorer™ from Microsoft Corporation of Redmond, Wash.Alternatively, an enterprise user may use a personal digital assistant(PDA), such as a Palm™ organizer from Palm Inc. of Santa Clara, Calif.,a web-enabled cellular phone, or similar device.

Network 18 represents any communication network, such as a packet-baseddigital network like the Internet. In this manner, system 10 can readilyscale to suit large enterprises. Enterprise users 12 may directly accessenterprise planning system 14 via a local area network, or may remotelyaccess enterprise planning system 14 via a virtual private network,remote dial-up, or similar remote access communication mechanism.

Enterprise planning system 14 may utilize a “cut-down” process by whichthe multidimensional data store is “sliced” for each user 12 inaccordance with the defined enterprise model. During this process,enterprise planning system 14 identifies areas of the defined model towhich users 12 are assigned, either as contributors or reviewers, and“slices” the data store based on the assignments. When a given user 12logs in and proceeds with an enterprise planning activity, enterpriseplanning system 14 communicates the respective data slice to therespective computing device 16 for display to the user via the extendedspreadsheet application. In this fashion, enterprise planning system 14need not communicate the entire model to each of users 12, therebyreducing communication time as well as resource requirements. Instead,each user 12 receives only relevant information. Users 12 interact withcomputing devices 16 to capture contribution data, and to reconcile thecontribution data with organizational targets.

As described herein, enterprise planning system 14 automaticallyproduces a database schema for publishing or otherwise outputting themultidimensional data to a relational database. Enterprise planningsystem 14 produces the database schema based on the organization of themultidimensional enterprise planning data being published. As describedin further detail below, the database schema may be a relational starschema that is optimized to store the multidimensional data in arelational form.

Upon creating the database schema, enterprise planning system 14populates the database schema with the multidimensional planning data.Other software applications may then utilize the published planningdata. For example, the database schema may serve as a staging area tomove the enterprise data to a data warehouse.

As another example, the database schema may be used for generatingreports 17 based on the multidimensional enterprise data. As describedfurther, enterprise planning system 14 may automatically generate areporting model from the database schema. The reporting model serves asa framework from which reports 17 may easily be produced from themultidimensional enterprise data published to the database schema.Enterprise planning system 14 automatically regenerates the reportingmodel from the database schema, and synchronizes the regeneratedreporting model to include any user changes applied to the previousreporting model.

FIG. 2 is a block diagram illustrating one embodiment of a computingdevice 16A, including various software modules executing thereon, whenoperated by a user 12A, such as a contributor or a reviewer. In theexemplary embodiment, computing device 16A includes web browser 20,calculation engine 22, and one or more data cubes 24. In addition,computing device 16A includes publish module 26, schema generator 30,model generator 32, and report tool 34.

In one embodiment, calculation engine 22 comprises a forward calculationengine 22 wrapped in an Active X object built in an array-basedlanguage. In the example of enterprise planning, user 12A may interactwith web browser 20 to enter and manipulate budget or forecast data.Data cube 24 contains planning data, which may include top-down targetsand bottom-up contribution data. Calculation engine 22 and data cube 24allow all calculations for an enterprise planning session to beperformed locally by computing device 16A. Therefore, in this example, acontributor can modify his or her respective contribution data, andperform calculations necessary for the enterprise planning processwithout necessarily accessing enterprise planning system 14. In otherwords, calculation engine 22 and data cube 24 may be maintained locally(e.g., as ActiveX components) via computing device 16A.

User 12A may save the planning data locally, and submit the planningdata to enterprise planning system 14 for aggregation with the planningdata from other users. Enterprise planning system 14 automaticallyaggregates the contribution data across enterprise 4 in real-time, andprovides access to the aggregated data to reviewers associated withhigher levels of the enterprise. This process continues until thecontribution data is ultimately approved by the reviewers associatedwith the highest level of the organizational hierarchy, thereby ensuringthat the contribution data from the contributors reconciles withcorporate targets. In other embodiments, calculation engine 22 and datacube 24 may be maintained only at enterprise planning system 14 andinstalled locally upon computing devices 16.

In general, publish module 26 represents a software module forpublishing multidimensional data from one or more data cubes 24. A user,such as user 12A, interacts with publish module 26 to initiate a publishprocess. During this process, user 12A selects one or more dimensionsfrom one or more data cubes 24. Schema generator 30 automaticallyproduces a database schema to store the selected multidimensional datain relational database form. In particular, schema generator 30 analyzesdata cubes 24 and the dimensions that compose the cubes to determinewhich relational tables need to be created within the database schema.

Model generator 32 automatically generates a reporting model based onthe database schema. Reporting tool 34 outputs reports 17 to present thepublished multidimensional data in accordance with the reporting model.

FIG. 3 is a block diagram that illustrates certain software componentsof the remote computing device in further detail. As illustrated in FIG.3, publish module 26 represents a software module by which a user, suchas user 12A, publishes multidimensional data from one or more data cubes24. In particular, schema generator 30 automatically produces a databaseschema 36 to store the multidimensional data in relational databaseform.

In general, database schema stores two forms of metadata. The first typeincludes metadata pertaining to data cubes 24. In particular, themetadata defines the data types and formats for measures within datacubes 24. As a result, database schema 36 supports heterogeneous datatypes and heterogeneous formats. The second type of metadata stored bydatabase schema 36 is metadata that describes the schema itself. Thissecond type of metadata may allow for enhanced interpretation ofdatabase schema 36 by a developer or an automated tool. As one example,user 12A may interact with model generator 32 to subsequently modify themetadata to add members or levels to the published dimensions, changesecurity settings applied to the published multidimensional data orperform other modifications.

To allow calculations to be performed on the heterogeneous data, schemagenerator 30 may generate database schema 36 to store all of the datatypes represented by each measure within data cubes 24. For example,schema generator 30 may create multiple columns for each measure. As oneexample, schema generator 30 may create three columns for each measure:a first column to store float values, a second column to store datevalues, and a third column to store text values for the measure. Publishmodule 26 populates database schema 36 with the multidimensional data.When publishing a particular value for a measure, publish module 26determines the data type and stores the value in the appropriate columnof database schema 36.

Model generator 32 automatically generates base reporting model 40 basedon database schema 36. Model generator 32 may further allow user 12A tomodify base reporting model 38 to produce user reporting model 38. Userreporting model 38 may, for example, define additional calculations orformat attributes for use by reporting tool 34 when generating reports17 to present the published multidimensional data. Model generator 32may maintain activity log 22 to record the modifications to base model40 in order to subsequently regenerate user reporting model 38.

FIG. 4 is a diagram illustrating one embodiment of database schema 36.In the illustrated embodiment, database schema 36 includes a metadataregion 50 for storing metadata and a data region 52 for storing thepublished multidimensional data.

In the example of FIG. 4, schema generator 30 automatically organizesdata region 52 in the form of a relational star schema for each datapublication. Data region 52 is referred to as a “star schema” becausethe entity-relationship diagram of this schema resembles a star, asillustrated in FIG. 4, with “points” of the star radiating from acentral table. In particular, the center of the star consists of a largefact table 68, and the points of the star are dimension tables 62A-62N(“dimension tables 62”).

For each publication of multidimensional data from data cubes 24,database schema 36 updates metadata region 50 and generates a new starschema. Consequently, each publication is characterized by a star schemahaving a very large fact tables 68 that contain the primary information,(i.e. data cube keys and measures), and a number of smaller dimensiontables 62. Dimension tables 62 may be viewed as lookup tables, each ofwhich contains information about the dimension members for a particulardata cube in the fact table. As one example, dimension A may representgeographical sales regions, dimension B may represent products,dimension C may represent time, and dimension D may represent versions.

In general, metadata region 50 stores metadata pertaining to publisheddata cubes 24. In particular, the metadata defines the data types andformats for measures within any of data cubes 24 that have beenpublished. Metadata region 50 also stores metadata that describesdatabase schema 36 itself. This metadata may allow for enhancedinterpretation of database schema 36 by a developer or an automatedtool. As one example, reporting tool 34 and model generator 32 mayutilize the metadata describing database schema 36 for enhancedinterpretation and reporting of the published multidimensional data.

In this example, metadata region 50 includes an application object table64, an application column table 66 and a dimension formats table 63.Application object table 64 contains metadata that describes thedifferent dimension tables 62 and fact tables 68 automatically createdby schema generator 30 to publish the selected multidimensional data. Inparticular, each row of application object table 64 contains metadatafor a different multidimensional object that was published and, morespecifically, the dimension tables 62 that store the multidimensionalobject. Table 1 lists exemplary columns for one embodiment ofapplication object table 64. TABLE 1 Column Description objectnameContains the name of the table used to store the model objects, such ascubes and dimensions. displayname Contains the display name, as seen byusers 12 in the enterprise application, of the model objects, such as acubes and dimensions. objectid Contains a global unique identifier(GUID) of the model object. objecttypeid Contains an object typeidentifier that identifies which type of table this row describes.datastore- Contains a data store object type identifier that identifiestypeid what type of database object this row describes for example, adatabase TABLE or a database VIEW. objectversion Specifies a version ofthe enterprise software from which the object was created. lastsavedContains the timestamp of when the object was last published. Thiscolumn is optional. libraryid Contains an identifier for the enterprisesoftware library from which this object was created. This column isoptional.

Application column table 66 contains metadata that describes theindividual columns of the different dimension tables 62 and fact tables68. Specifically, each row of application column table 66 describes arespective column of dimension tables 62 or fact tables 68. As a result,application column table 66 will contain multiple rows for each row inapplication object table 62. Table 2 lists exemplary columns for oneembodiment of application column table 66. TABLE 2 Column Descriptionobjectname Contains the name of the model object's table. Theobjectnames are the same names found in the objectname column of theapplicationobject table. columnname Contains the name of the columncontained in the table used to publish an object. displayname Contains adisplay name, as seen by users in the enterprise application, toassociate with the column contained in the table used to publishplanning objects. This is essentially a readable name for the columninstead of the name used in a database system that may have limitationsfor certain characters. columnid This is a GUID associated with thecolumn contained in the table used to publish a planning objectobjecttypeid Contains the objecttype id of the table containing thiscolumn. columntype- Contains an independent data type identifier forthis id column. This identifier can be, for example, TEXT_VALUE orFLOAT_VALUE. columnorder Contains the order of this column in the tableused to publish a planning object. This is used for ordering the displayof columns. logicaldata- Contains an RDBMS independent type identifierfor this type column.

Dimension formats table 63 contains metadata that describes the datatype and formatting information of measure columns of the different facttables 68. Specifically, each row of dimension formats table 63describes the data type of the measure column and the attributes of itsformat such as the scale of a numeric value. Table 3 lists exemplarycolumns for one embodiment of dimension formats table 63. TABLE 3dimension The GUID of the dimension which contains the formatted guidcolumn. itemid The GUID of the column which has a data type and format.formattype The data type of the column. For example, numeric, data orpercentage. negativesign The symbol to use to represent negative valuessymbol noofdecimal The number of decimal places (or precision) todisplay. places scalingfactor The scaling factor of a numeric value.zerovalue The characters to use to represent values which are charsequal to 0.

FIG. 5 illustrates an example organization of database schema 36 for asingle dimension, e.g., dimension 62A in this example. When formingdatabase schema 36, schema generator 30 examines the dimensions thatcompose the cubes and exports the dimension information in three forms.

First, schema generator 30 creates item tables 70 that list of all themembers of all the dimensions being published. In particular, itemtables 70 provide a flat list of the dimensions with no hierarchyinformation. In one embodiment, each row of item tables 70 is capable ofstoring a member name, a caption, a global unique identifier (guid), aninteger identifier and a display order within the dimension. Item tables70 may be used to generate reports 17 for displaying all of the data inthe cube without needing to recreate any calculations with the report.In other words, all the members of the dimension from leaf members toroot members are present within item tables 70.

Next, schema generator 30 generates simple sum tables 72. Simple sumtables 72 contain dimension information in hierarchical form. Inparticular, simple sum tables 72 provide a dimension hierarchy that isguaranteed to aggregate to the correct totals represented within thedata cube. In other words, reporting tool 34 may apply summationoperations of the lower levels of the hierarchy when generating reports17 and the same totals will be realized as the totals within the datacube. Consequently, reporting tool 34 may utilize simple sum tables 72to recreate certain calculations within the data cube. This allowsreports 17 to manipulate the sums and perform further analysis on thepublished data. In order for schema generator 30 to produce simple sumtables 72 in a manner that is guaranteed to provide correct totals, theschema generator may remove some members of the dimension as describedherein with respect to FIGS. 7A-10B.

Finally, schema generator 30 generates calculated hierarchy tables 74that contain complete dimension information in hierarchical form. Inother words, the calculated hierarchy is represented to give as muchinformation as possible. However, the dimension hierarchy is notguaranteed to be aggregatable. In particular, calculations withinreports 17 may not necessary provide the same totals since reportingtool 34 likely has a more limited calculation engine than enterpriseplanning system 14.

FIG. 6 is a flowchart that illustrates exemplary operation of thesoftware components illustrated in FIG. 3. Initially, a user, such asuser 12A, interacts with publish module 26 to initiate a publicationprocess. In particular, publish module 26 provides a user interface bywhich user 12A identifies one of data cubes 24 and selects one or moreof the dimensions of the identified data cube for publication (80). Asdescribed in further detail below, publish module 26 may analyze theselected data cubes and automatically provide user 12A with a defaultdimension for publish per data cube. During this process publish module26 may automatically remove dimensions which are not good candidates forpublication. The publish process determines this “best” dimension forpublication as follows. First, publish module 26 identifies all of thedimensions in the selected cubes for which the modeler sets formattinginformation for the data. If only one dimension in a give cube hasformats, publish module 26 select that dimension as the defaultdimension for publication for the respective data cube. If two or moredimensions in any given cube have formats, publish module 26 selects thedimension having the lowest assigned priority of calculation, where thecalculations with the lowest priority are executed first. If thedimensions of the data cube have equal calculation priority, publishmodule 26 selects the first dimension as the default dimension forpublication for the data cube.

Next, publish module 26 invokes schema generator 30 which automaticallyproduces database schema 36 to store the multidimensional data inrelational database form (82). During this process, schema generator 30creates a central fact table 68 for database schema 36 and one or moredimension tables 62 for each dimension being published. For eachdimension, schema generator 30 creates one or more of an item table 70,a simple sum table 72 or a calculated hierarchy table 74 depending uponthe desires of user 12A.

Further, schema generator 30 stores metadata in data cube metadata table64 that describes the selected data cube and constituent dimensionsbeing published. Schema generator also stores metadata in schemametadata table 66 that describes database schema 36. Schema generator 30may be a separate software application from publish module 26 or may beone or more software routines (e.g., dynamic link libraries) callable bypublish module 26.

After creation of database schema 36, publish module 26 accesses theselected data cube 24 and retrieves multidimensional data to populatethe database schema (84). As described above, publish module 26populates the corresponding fact table 68 of the newly created databaseschema 36 with keys and measures of the data cube being published.Publish module 26 then populates the dimension tables 62 for eachdimension being published. For each dimension, publish module 26populates one or more of an item table 70, a simple sum table 72 or acalculated hierarchy table 74 depending upon the desires of user 12A.

Next, model generator 32 automatically generates base reporting model 40based on the newly created database schema 36 (86) and creates userreporting model 38 by importing the definitions contained in the basereporting model. Model generator 32 allows user 12A to enhance userreporting model 38 (e.g., by defining layout and formatting attributes).Model generator 32 maintains activity log 22 to record the modificationsto user reporting model 38 (88). In this manner, model generator 32 mayreapply the changes to regenerate user reporting model 38 in the eventdatabase schema 36 and base reporting model 32 are subsequently changed.

In response to input from user 12A, reporting tool 34 outputs reports 17to present the published multidimensional data in accordance with theenhanced user reporting model 38 (90).

FIGS. 7A-10B graphically illustrates an example process by which schemagenerator 30 generates simple sum tables 72 of dimension tables 62. Thesimple sum tables 72 contain dimension members in a hierarchical form.With respect to simple sum tables 72, schema generator 30 generatessimple hierarchies used to describe simple parent-child relationshipsbetween nodes. Each row of a simple sum table 72 represents a leafmember and, more specifically, each row represents a complete path froma root of the dimension hierarchy to a leaf member.

As discussed briefly above, in order for schema generator 30 to producesimple sum tables 72 in a manner that is guaranteed to provide correcttotals, schema generator 30 may reorganize the nodes in the hierarchy toensure that they are aggregatable. By guaranteeing that the informationis aggregatable, report tool 34 may apply summation operations of thelower levels of the hierarchy and the same totals will be realized asthe totals within the data cube. Consequently, report tool 34 mayutilize simple sum tables 72 to recreate certain calculations within thedata cube. This allows reports 17 to manipulate the sums and performfurther analysis on the published data.

In general, schema generator 30 generates a simple sum hierarchy from adimension based on a set of rules. First, schema generator 30 scans eachdimension item (member) and uses the item's associated mathematicalexpression to identify the item's parent. The parent of an item is thefirst simple sum that references the item as an input. In the case wherethere are multiple candidates for the parent of a node, the node isassigned to the first parent in model order and the other candidateparents are considered to be leaf nodes in the hierarchy. The modelorder refers to the order in which items have been added to the model.In the case where a parent cannot be identified using the two previousrules and the node is not a simple sum, the item is considered to be aroot node (referred to as orphan node in the rest of the text). Leafnodes are simply the end nodes as defined by this algorithm and may beassociated with complex calculations. Finally, all parents and ancestorsof the simple sum nodes are included in the final dimension hierarchythat is published.

Simple sum nodes are those nodes whose associated calculation is asimple sum operating on nodes that are entirely independent of any othernodes in the hierarchy. Non-simple sums may include, for example, thosenodes whose associated calculation is anything other than a simple sum,such as a multiplication. Examples of such non-simple sums areillustrated with respect to FIGS. 9A-9B and 10A-10B. Another example ofa non-simple sum may include those nodes whose associated calculationoperates on a node that is not entirely independent of another node inthe hierarchy. One example of such a non-simple sum is illustrated withrespect to FIGS. 8A-8B.

In one embodiment, to discard the non-simple sum nodes, the followingrules may be applied. First, if a node has more than one parent, assignparentage to the first parent in the model order, and movesub-hierarchies of the second parent in the hierarchy order to the root.This will result in the second parent in the hierarchy order becoming aleaf node. Second, if a node has an associated calculation that is a notsimple summary, move all sub-hierarchies of that node to the root. Thiswill result in the non-simple sum node becoming a leaf.

In the examples shown in FIGS. 7A-10B, nodes on an enterprise planningmodel are illustrated. The nodes are represented by letter/numbercombinations and the relationships between items are drawn as lines.Parent nodes represent items that are calculated from other nodes.Parent nodes have their example values shown in parentheses, such as“(3)” for node [B1] in FIG. 7A. Leaf nodes have 2 values displayed, suchas “(5,0)” for node [D1] in FIG. 7A. The first item in parentheses isthe node value, and the second item is a “leaf code.” Schema generator30 utilizes the leaf codes to describe information that was lost whenthe dimension hierarchy was reorganized in order to guarantee that thehierarchy will aggregate the correct totals represented within the datacube.

In the example embodiments shown in FIGS. 7A-10B, leaf codes are asfollows:

0=Direct child of a simple sum node,

1=The leaf has multiple parents,

2=The leaf item is part of a sub-hierarchy which has been moved to theroot, and

3=The leaf item is an orphan or had no parent.

FIG. 7A represent an example dimension hierarchy defined within a datacube prior to publication. In this example, all of the nodes of thedimension hierarchy are defined by simple summaries. Specifically, therelationships between the nodes in FIG. 7A are described within the datacube by the following equations.A1=B1+D1+G1,B1=C1+E1,G1=H1+P1, andP1=S1+T1.

FIG. 7B depicts the simple summary hierarchy 122 produced by schemagenerator during publication for storage within simple sums table 72.The leaf node values and leaf codes of simple sum hierarchy 122 areshown in parenthesis. Note that because every node in hierarchy 120 isalready described by a simple summary, each leaf node [C1], [E1], [D1],[H1], [S1], and [T1] has a leaf code equal to 0. No reorganization ofhierarchy 120 was required in this case because each node is defined asa simple summary.

FIG. 8A shows an example of a dimension hierarchy 126 including a leafnode with multiple parents prior to publication. In this example, therelationships between the nodes in FIG. 8A are described by thefollowing equations.A2=B2+D2+G2,B2=C2+E2,D2=E2+F2,G2=H2+P2, andP2=S2+T2.

In this example, leaf node [E2] has more than one parent. In that case,schema generator 30 assigns parentage to the first parent in modelorder. In one embodiment, schema generator 30 reorganizes hierarchy 126such that parent node [D2] becomes a leaf node and node [F2] becomesorphaned and is moved to the root. FIG. 8B depicts the resulting simplesummary hierarchy 128 produced by schema generator during publicationfor storage within simple sums table 72.

FIG. 9A shows an example of a dimension hierarchy 130 including anon-simple summary. The relationships between the nodes in FIG. 9A aredescribed by the following equations.A3=B3+D3+G3,B3=C3+E3,G3=H3+P3, andP3=S3*T3.

In the example shown in FIG. 9A, parent node [P3] is the product of leafnodes [S3] and [T3]. In one embodiment, schema generator 30 moves leafnodes of non-simple summaries to the root. Since node [P3] became a leafnode, node [S3] and [T3] were orphaned and moved to the root. FIG. 9Bdepicts the resulting simple summary hierarchy 132 produced by schemagenerator during publication for storage within simple sums table 72.

FIG. 10A shows an example of a dimension hierarchy 134 including asub-hierarchy 133 of a non-simple summary. The relationships between thenodes in FIG. 10A are described by the following equations.A4=B4+D4+G4,B4=C4*E4, andC4=S4+T4.

In this example, node [B4] is the product of node [C4] and [E4]. Node[C4] own simple summary hierarchy. FIG. 10B depicts the resulting simplesummary hierarchy 136 produced by schema generator during publicationfor storage within simple sums table 72. Since non simple sum nodescannot be parents, schema generator 30 reorganizes the hierarchy suchthat node [B4] becomes a leaf, node [E4] and [C4] become orphaned andmoved to the root. Node [C4] keeps its sub-hierarchy.

FIG. 11 shows a portion of an example fact table 68 created for apublished data cube. In the example of FIG. 11, fact table 68 containsan identifier column 140A-140C for each dimension other than thedimension selected for publication with this data cube. Thus, in theexample of FIG. 11, fact table 68 includes identifiers columns 140A-140Cfor an “employee” dimension, an “elist” dimension, and a “versions”dimension.

Fact table 68 also contains one or more columns for each measure of thecube, depending on which columns are selected by user 12A. Inparticular, fact table 68 contains columns to support the data typesrequested by user 12A. In this example, three columns are used for eachmeasure. Consequently, schema generator 30 generates fact table 68 toinclude column 142A-142C to store data for the “grade” measure in text,float, and date format, respectively. Similarly, multiple columns areused to store data for a “base salary” measure and other measures,although FIG. 11 depicts only a portion of fact table 68 for ease ofillustration.

FIG. 12 is a flowchart illustrating in further detail an exemplaryprocess of generating base reporting model 40 and user reporting model38. Initially, a user, such as user 12A, interacts with model generator32 to identify a relational database storing one or more publicationschemas, such as database schema 36 (150).

Next, user 12A provides general information required to execute themodel creation process, such as connection details for connecting withreporting tool 34 and a location to store base reporting model 40 anduser reporting model 38 once created (152).

Model generator 32 then presents user 12A with a list of databaseschemas (e.g., star schemas) that were previously published as describedabove in the selected relational database (154). Model generator 32receives input from user 12A selecting one or more of the databaseschemas for inclusion in generating the reporting models, i.e., basereporting model 40 and user reporting model 38 (156).

Next, user 12A selects particular dimension information for use in thereporting models (158). In particular, if the multidimensional datastored in the selected database schemas can be recreated using differentviews of the dimension, model generator 32 allows user 12 to select oneor more of the views for each data cube for use in reporting models.

Model generator 32 automatically interprets the metadata stored withinthe selected database schemas and outputs a base reporting model 40based on the selected dimensional information (160) and a user reportingmodel 38 by importing the definitions in the base reporting model 40.Finally, model generator 32 may invoke reporting tool 32 to create oneor more default reports for each database schema selected by the userwhen generating the reporting models (162). These reports can serve asthe basis for authoring more advanced reports.

FIG. 13 is a flowchart illustrating in further detail an exemplaryprocess of updating user reporting model 38. Initially, model generator32 imports the definitions from base reporting model 40 into userreporting model 38 (180), and allows a user, such as user 12A, to modifythe user reporting model 38 (181). For example, user 12A may renameelements of the model, remove elements or move elements within the userreporting model 38. Model generator 32 captures the changes made to userreporting model 38 and maintains them in activity log 22. Moreover,model generator 32 maintains activity log 22 to record the particularchanges and the order in which the changes were applied (182).

In addition, model generator 32 allows user 12A to modify the metadatacontained in the underling database schema 36 used in generating thereporting model (183). For example, user 12A may select additional datacubes, may add members or levels to dimensions or may change thesecurity settings applied to the published multidimensional data.

Once the changes have been made, model generator 32 loads the initialselections made by the user as described in reference to FIG. 12 (184).In particular, model generator 32 loads the dimension selectionsoriginally used to generate the reporting models.

Next, model generator 32 deletes the old base reporting model 40 andregenerates the base reporting model based on the modified metadatawithin database schema 36 (186). Model generator 32 then synchronizesuser reporting model 38 (188). In particular, model generator 32 deletesthe user reporting model and recreates the user reporting model from thenewly generated base reporting model 40. Model generator 32 thenre-applies the changes recorded in activity log 22 (190) to restore theprevious enhancements made to user reporting model 38 by user 12A. Modelgenerator 32 then updates default reports based on the updated userreporting model (192).

In this manner, model generator 32 preserves enhancements to a userreporting model 38 while allowing the user to modify the underlyingdatabase schema 36 containing published multidimensional data. Reportingtool 34 utilizes user reporting model 38 to generate reports 17.

In one embodiment, model generator 32 creates base reporting model 40 asa plurality of extensible markup language (XML) files. Model generator32 generates a set of folders in the reporting model to storedefinitions that describe a “physical view” of database schema 36. Inparticular, the first set of folders contains information describingeach relational table of database schema 36, including fact tables 68and dimension tables 62. The second set of folders contains definitionsthat describe a “business view” and contains information describing therelationship between the tables. In particular, the second set offolders contains definitions for each star schema associated with eachfact table within the database schemas selected for the report model.When generating base reporting model 40 model generator 32 includesdefinitions that specify the relationship between the fact tables andthe dimension tables and, in particular, the primary and foreign keysand the cardinality of the relationships.

Further, when creating base reporting model 40, model generator 32generates the definitions to define objects for the relational tablesand columns of database schema 36. Model generator 32 utilizes thestored metadata to ensure that the names assigned to the tables andcolumns in the reporting model are the same as in the model stored inthe enterprise planning system. Model generator 32 generates definitionsfor columns which specify the usage (e.g., data types, attributes,identifiers) for each column within database schema.

Model generator 32 also includes definitions within the reporting modelthat allows reporting tool 34 to understand the hierarchical structureof the dimensions described by the various dimension tables 62 (e.g.,item tables 70, simple sum tables 72 and calculated hierarchy tables74). As described above, this may be used by reporting tool 34 when user12A wants to aggregate and summarize values in one or more of reports17. These definitions are used by the reporting tool to makeaggregations at the correct level when shared dimensions are used.

FIG. 14 is a screen illustration of an example user interface 200 withwhich a user, such as user 12A, interacts to initiate publication ofmultidimensional data. FIG. 15 is a screen illustration of an exampleuser interface 202 with which user 12A selects one or more dimensions(“D-List”) for any of the available data cubes (“D-Cubes”).

FIG. 16 is a screen illustration of an example user interface 204 withwhich user 12A may select option 206 to direct schema generator 30 tocreate all columns necessary to support the data types for the dimensionbeing published. Alternatively, user 12A may select one or more of datatypes 208 to direct schema generator 30 to only create columns for thespecified data types.

FIG. 17 is a screen illustration of an example user interface 210produced by schema generator 30 to display the columns and theircorresponding column name and column data types which will be createdwhen the database schema 36 is created.

FIG. 18 is a screen illustration of an example user interface 214 ofmodel generator 32. As illustrated, user interface 212 provides a listof models. For each model, user interface 214 displays the correspondingfolders (e.g., physical view folder, business view folder and allsub-folders) as well as the contained object definitions such as tabledefinitions.

FIG. 19 is a screen illustration of an example user interface 216 ofmodel generator 32 by which user 12A selects data cubes from databasescheme 36 to utilize when generating base reporting model 40 and userreporting model 38.

FIG. 20 is a screen illustration of an example user interface 218 ofmodel generator 32 by which user 12A selects the type of dimensionhierarchy information to include in base reporting model 40 and userreporting model 38. In particular, user interface 218 allows user 12A toselect one or more: (1) unformatted lists form items tables 70, (2)derived hierarchy lists from simple sum tables 72, or (3) calculatedhierarchy lists from calculated hierarchy tables 74.

FIG. 21 is a screen illustration of an example user interface 220 ofmodel generator 32 by which user 12A initiates a synchronization processafter changing database schema 36 to recreate user reporting model 38based on activity log 22. As illustrated, user interface 212 provides alist of activity logs selectable by the users. For each activity log,user interface 212 lists the particular modifications that have beenrecorded and the order in which the modifications occurred. In thismanner, user 12A is able to view the modifications that would be“re-applied” after the synchronization of the user reporting model 38with the base reporting model 40.

Various embodiments of the invention have been described. Althoughdescribed in reference to an enterprise planning system, such as anenterprise financial or budget planning system, the techniques may bereadily applied to other software systems, including other large-scaleenterprise software systems. Examples of other enterprise softwaresystems include order management systems, inventory management systems,sales force management systems, business intelligent tools, enterprisereporting tools, project and resource management systems and otherenterprise software systems. Moreover, the techniques may be implementedon any type of computing device, including servers, client computers,laptops or other devices. These and other embodiments are within thescope of the following claims.

1. A computer-implemented system comprising: a data store having multidimensional data, wherein the multidimensional data includes a data cube having a dimension; and an executable software module that accesses the data store and generates hierarchy data describing a hierarchy of the dimension of the data cube, wherein the software module generates hierarchy data so that the hierarchy data is guaranteed to aggregate, using only summations, to totals represented within the data cube.
 2. The system of claim 1, wherein the software module generates the hierarchy data as a simple sum table of a database schema.
 3. The system of claim 1, further comprising a reporting tool that outputs a report based on the hierarchy data.
 4. The system of claim 1, wherein the dimension includes a plurality of hierarchically arranged members, and wherein the software module outputs the hierarchy data to describe the hierarchical relationship of the members.
 5. The system of claim 4, wherein software module generates the hierarchy data to include less than all of the members of the dimension.
 6. The system of claim 1, wherein the software module generates the hierarchy data to include a set of nodes that describe parent-child relationships between the members of the dimension.
 7. The system of claim 6, wherein the software module analyzes mathematical expressions defined by the members to identify parent members that refer to child members, and generates the hierarchy data to represent the parent members and the child members.
 8. The system of claim 6, wherein for each member of the dimension, the software module determines whether a parent member exists within the dimension that defines a sum that refers to the member as an input to the sum.
 9. The system of claim 6, wherein when the software modules identifies multiple parent members within the dimension for a child member, the software module generates the hierarchy data by selecting one of the identified parent members as the parent member for the child member and designates the remaining parent members as leaf nodes within the hierarchy data.
 10. The system of claim 6, wherein when the software modules identifies no parent members for a member of the dimension, the software module designates the member as a root node within the hierarchy data.
 11. The system of claim 1, wherein the software module generates the hierarchy data to include a hierarchy of nodes that represent members of the dimension, wherein the nodes include a set of sum nodes associated with calculations that are only summations of other nodes and a set of root nodes associated with calculations other than only summations.
 12. A method comprising: storing multidimensional data that includes a dimension having hierarchical members; and generating hierarchy data that describes the hierarchical members of the dimension in a form that is guaranteed to aggregate, using only summations, to totals represented within the data cube.
 13. The method of claim 12, wherein generating hierarchy data comprises generating a table of a relational database schema, wherein each row of the simple sum table represents a complete path from a root member of the dimension to a leaf member.
 14. The method of claim 12, further comprising outputting a report based on the hierarchy data.
 15. The method of claim 12, wherein generating hierarchy data comprises generating the hierarchy data to include less than all of the members of the dimension.
 16. The method of claim 12, wherein generating hierarchy data comprises generating the hierarchy data to include a set of nodes that describe parent-child relationships between the members of the dimension.
 17. The method of claim 16, wherein generating hierarchy data comprises: analyzing mathematical expressions defined by the members to identify parent members that refer to child members, and generating the hierarchy data to represent the relationships between the parent members and the child members.
 18. The method of claim 16, wherein analyzing mathematical expressions includes determining for each member whether a parent member exists within the dimension that defines a sum that refers to the member as an input to the sum.
 19. The method of claim 16, further comprising: determining whether a child member has multiple parent members; and selecting one of the identified multiple parent members as the parent member for the child member; and designates the remaining parent members as leaf nodes within the hierarchy data.
 20. The method of claim 16, further comprising: identifying a set of the members for which no parent members can be identified; and designates the members as root nodes within the hierarchy data.
 21. The method of claim 12, wherein generating hierarchy data comprises generating the hierarchy data to include a hierarchy of nodes that represent members of the dimension, wherein the nodes include a set of sum nodes associated with calculations that are summations of other nodes and a set of root nodes associated with calculations other than summations.
 22. A computer-readable medium having instructions that cause a programmable processor to: access multidimensional data that includes a dimension having hierarchical members; and generate hierarchy data that describes the hierarchical members of the dimension, wherein the hierarchy data defines a hierarchy of nodes that represent members of the dimension, wherein the nodes includes a set of sum nodes associated with calculations that are summations of other nodes and a set of root nodes associated with calculations other than only summations. 